Reply to Veresoglou: Overdependence on "significance" testing in biology.
نویسندگان
چکیده
In PNAS, we explore the effects of interacting global change factors on the functioning of decomposer communities and show how biotic interactions influence the strength of soil carbon feedbacks to climate change (1). Veresoglou (2) highlights that the highly interactive nature of our multifactor experiment can increase the likelihood of type I errors (i.e., “false positives”), an effect that he refers to as “P hacking.”We appreciate this perspective because it provides a platform to discuss what we believe is a critical topic in biology: an overdependence on significant P values. P hacking refers to “data-dredging, snooping, fishing, significance-chasing and doubledipping” (3), essentially any instance of selectively searching for only significant P values. However, P hacking is not simply the reporting of a large number of P values. The multifactor experimental design necessitates the testing of multiple comparisons, but we focus on a specific set of preplanned hypotheses throughout. We discuss “significant” and “nonsignificant” relationships and, following convention, report only the relevant P values in the text. The significance values of all preplanned comparisons are presented in the accompanying figures. Of course, Veresoglou (2) is correct that reporting a large number of P values inflates the likelihood of unveiling significant (P < 0.05) effects. However, statistics is far from black and white. Adjusting P values to protect against type I errors (false positives) comes at a considerable cost: it inflates the probability of type II errors (false negatives) (4). When false negatives are considered to be as damaging as false positives, then adjusting for multiple comparisons is strongly discouraged (4, 5). That is, “scientists should not be so reluctant to explore leads that may turn out to be wrong that they penalize themselves by missing possibly important findings” (5). This is particularly important in complex ecological systems where mechanisms can be obscured by huge variability. Irrespective of the P values we report, many of the trends that we focus on will require further investigation, so a strict “accept/reject” framework would be counterproductive. Instead, we took great care to address a question from multiple angles using all available data and previous research. We then provide raw P values so that the reader can interpret relationships as they see fit [following Gelman et al. (4)]. We support Fisher’s original intention for the P value; it should be “just one part of a fluid, non-numerical process that blended data and background knowledge to lead to scientific conclusions” (3). The merit of our work does not hinge on the chance identification of some statistically significant effects that we selectively report (1). As highlighted by Veresoglou (2), the hypotheses we test are based on previous work and established theory. Accounting for multiple comparisons in our study would not change our findings. We encourage the reader to assess our work based on all of the evidence provided in the preceding literature, figures, and other statistical values, in addition to the reported P values. To conclude, we agree with Leek and Peng (6) that, “arguing about the P value is like focusing on a single misspelling, rather than on the faulty logic of a sentence.” Thomas W. Crowther, Daniel S. Maynard, Stephen M. Thomas, Petr Baldrian, Kristofer Covey, Serita D. Frey, Linda T. A. van Diepen, and Mark A. Bradford Yale School of Forestry and Environmental Studies, Yale University, New Haven, CT 06511; Department of Terrestrial Ecology, Netherlands Institute of Ecology (NIOOKNAW), 6700AM, Wageningen, The Netherlands; Department of Environmental Sciences, University of Helsinki, FI-00014 Helsinki, Finland; Laboratory of Environmental Microbiology, Institute of Microbiology of the Academy of Sciences of the Czech Republic, v.v.i., 14220 Prague 4, Czech Republic; and Department of Natural Resources and the Environment, University of New Hampshire, Durham, NH 03824
منابع مشابه
Oil Overdependence and Dutch Disease, KSA Evidence
In the context of recent oil prices fall and resulting budgetary problems faced by KSA, it is very interesting to examine the country’s overdependence on oil revenues. In this paper, we examine whether extreme oil dependence for Saudi Arabia is the symptom of Dutch Disease. We first provide a brief review of the literature on Dutch Disease and the natural resource curse. We then discuss the sym...
متن کاملAuthor’s Reply to “A Note on the Simulation of Gas Diffusion in Capillaries and Porous Solids”
متن کامل
Author’s Reply to “A Note on the Simulation of Gas Diffusion in Capillaries and Porous Solids”
متن کامل
Transfer from action to perception: The effect of motor-perceptual enrichment
This study investigated the effect of audiovisual integration on action-perception transfer.40 subjects were randomly divided four groups: visual, visual-auditory, control visual and control visual-auditory. Visual groups watched pattern skilled basketball player and other groups in addition to watching pattern skilled basketball player, heard Elbow angular velocity as sonification. In first st...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Proceedings of the National Academy of Sciences of the United States of America
دوره 112 37 شماره
صفحات -
تاریخ انتشار 2015